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WEIGHTED RANDOM SCHEDULING PARTICULARLY APPLICABLE TO 

PACKET SWITCHING SYSTEMS 



5 TECHNICAL FIELD 

This invention especially relates to communications and computer systems; and 
more particularly, the invention relates to weighted random scheduling particularly 
applicable to packet switching systems. 



10 

BACKGROUND 

The communications industry is rapidly changing to adjust to emerging 
technologies and ever increasing customer demand. This customer demand for new 
applications and increased performance of existing applications is driving 

1 5 communications network and system providers to employ networks and systems having 
greater speed and capacity (e.g., greater bandwidth). In trying to achieve these goals, a 
common approach taken by many communications providers is to use packet switching 
technology. Increasingly, public and private communications networks are being built and 
expanded using various packet technologies, such as Internet Protocol (IP). Note, nothing 

20 described or referenced in this document is admitted as prior art to this application unless 
explicitly so stated. 

A network device, such as a switch or router, typically receives, processes, and 
forwards or discards a packet. There are many different scheduling algorithms that 
achieve one hundred percent throughput within a centrally scheduled, inputs queued 
25 switch. Input queued switches typically store packets destined for different destinations in 
a single queue which can lead to blocking conditions. To avoid this head of queue 
blocking, many network devices have at each input multiple virtual output queues 
(VoQs), with one or more queues corresponding to each output of the respective network 
device. This architecture requires the scheduling of the sending of packets from these 
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multiple VoQs (in contrast to selecting the packet at the head of the single input queue). 
Prior systems use a combination of virtual output queues (VoQs) and scheduling 
algorithms based on maximal weight matching. However, the practical implementations 
of these scheduling algorithms are complex and expensive to implement in hardware at 
high speed, and is becoming more problematic as line rates continue to increase. 

For example, SLIP is an algorithm for scheduling the sending of packets across an 
Nx N switch. In one implementation, the following three steps are performed: 

1 . Each unmatched input sends a request to every output for which it has a queued cell. 

2. If an unmatched output receives any requests, it chooses the one that appears next in a 
fixed, round-robin schedule starting from the highest selection priority element. The 
output notifies each input whether or not its request was granted. The pointer to the 
highest selection priority element of the round-robin schedule is incremented (modulo 
N) to one location beyond the granted input if and only if the grant is accepted in step 
3 of the first iteration. The pointer is not incremented in subsequent iterations. 

3. If an input receives a grant, it accepts the one that appears next in a fixed, round-robin 
schedule starting from the highest selection priority element. The pointer to the 
highest selection priority element of the round-robin schedule is incremented (modulo 
N) to one location beyond the accepted output. 

I-SLEP is a scheduling algorithm including multiple iterations of the SLIP 
algorithm to determine the scheduling of packets for each round of sending packets 
(rather than just one SLIP iteration.) 

Each output scheduler decides among the set of ordered, competing requests using 
a rotating selection priority. When a requesting input is granted and the input accepts that 
grant, the input will have the lowest selection priority at that output in the next cell time. 
Also, whatever input has the highest selection priority at an output will continue to be 
granted during each successive time slot until it is serviced. This ensures that a 
connection will not be starved: the highest selection priority connection at an output will 
always be accepted by an input in no more than N cell times. 
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Moving the pointers not only prevents starvation, it tends to desynchronize the 
schedulers. Each of the outputs that matched in the previous time slot will have a 
different highest selection priority input. Thus, they will each grant to different inputs. 
Consider an example in which two inputs are both requesting the same two outputs. 
5 Initially, both outputs may grant to the same input; in that case only one connection will 
be made in the first iteration. 

The successful output will increment its pointer and in the next cell time, the 
outputs will no longer contend: one will have moved on to grant to another input and the 
other will grant to the same input as before. This leads to a better match in the first 
10 iteration of the next cell time. This is because the output schedulers have become 

desynchronized (or "slipped' 1 ) with respect to each other. This leads to high performance, 
even for a single iteration of SLIP. 

Because of the round-robin movement of the pointers, the algorithm tends to 
provide a fair allocation of bandwidth among competing connections and to be 
15 burst-reducing. The burst-reduction is simplest to understand under high load when all 
input queues are occupied: the algorithm will visit each competing connection in turn, so 
that even if a burst of cells for the same output arrives at the input, the burst will be 
spread out in time if there is competing traffic. 

An example implementation is described in Nicholas W. McKeown, "Method and 
20 Apparatus for Scheduling Cells in an Input-Queued Switch, U.S. Patent No. 5,500,858, 
issued March 19, 1996, which is hereby incorporated by reference. Another example 
implementation is described in Nicholas W. McKeown, "Combined Unicast and 
Multicast Scheduling," U.S. Patent No. 6,212,182, issued April 3, 2001, which is hereby 
incorporated by reference. 

25 
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SUMMARY 

Disclosed are, inter alia, methods, apparatus, data structures, computer-readable 
media, mechanisms, and means for weighted random scheduling particularly applicable to 
packet switching systems. One embodiment generates a request to send a packet to one of 
5 the outputs of the switch for each input of a switch by weighted randomly selecting one of 
the outputs to which the particular input has one or more packets to send. One of the 
requests is granted for each different one of the outputs for which one or more requests 
were generated. Packets are sent between the inputs and the output corresponding to the 
granted requests. 

10 In one embodiment, a round consisting of generating the request, granting one of 

the requests, and sending packets is performed at least two times per packet time, with a 
packet time corresponding to the amount of time allocated to receive a packet at an input 
from an external source. In one embodiment, the switch is configured to setup 
connections between the switch inputs and the outputs corresponding to the granted 

15 requests prior to the sending packets. In one embodiment, the granting one of the requests 
includes randomly selecting one of the requests for each of the outputs for which one or 
more of the requests were generated. One embodiment performs one or more rounds of 
generating requests by weighted randomly selecting among all or a subset of the outputs, 
and selectively granting the requests corresponding to available bandwidth after granting 

20 one of the requests and before sending packets. 

In one embodiment, the weighted random selection is weighted based on a 
number of packets to send to each of the outputs by a corresponding input of the inputs. 
In one embodiment, the weighted random selection is weighted based on a number of 
bytes to send to each of the outputs by a corresponding input of the inputs. In one 

25 embodiment, the weighted random selection is weighted based on the last times packets 
were sent from a corresponding input of the inputs to each of the outputs. In one 
embodiment, the weighted random selection is weighted based on classes of service 
associated with packets to send to each of the outputs by a corresponding input of the 
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inputs. In one embodiment, the weighted random selection is weighted based on at least 
two items from a group consisting of: (a) the number of packets to send to each of the 
outputs by a corresponding input of the inputs, (b) the number of bytes to send to each of 
the outputs by a corresponding input of the inputs, (c) the last times packets were sent 
5 from a corresponding input of the inputs to each of the outputs, and (d) classes of service 
associated with packets to send to each of the outputs by a corresponding input of the 
inputs. 



10 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The appended claims set forth the features of the invention with particularity. The 
invention, together with its advantages, may be best understood from the following 
detailed description taken in conjunction with the accompanying drawings of which: 
5 FIGs. 1 A-C and 2A-B are block diagrams of a few of an unlimited number of 

embodiments using weighted random scheduling of packets; 

FIG. 3 is a block diagram of a component used in one embodiment for performing 
weighted random scheduling; 

FIG. 4A is a diagram illustrating the generation of requests and grants as 
10 performed in one embodiment; 

FIG. 4B is a diagram illustrating the generation of requests and grants as 
performed in one embodiment; 

FIG. 5 A is a flow diagram of a process used in one embodiment for performing 
weighted random scheduling of packets including determining requests based on a 
15 weighted random selection of destinations of pending packets at a switch input; 

FIG. 5B is a flow diagram of a process used in one embodiment for determining 
grants in response to received requests; and 

FIG. 6 is a flow diagram of a process used in one embodiment for performing 
weighted random scheduling of packets. 

20 
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DETAILED DESCRIPTION 

Disclosed are, inter alia, methods, apparatus, data structures, computer-readable 
media, mechanisms, and means for weighted random scheduling particularly applicable to 
packet switching systems. 
5 Embodiments described herein include various elements and limitations, with no 

one element or limitation contemplated as being a critical element or limitation. Each of 
the claims individually recites an aspect of the invention in its entirety. Moreover, some 
embodiments described may include, but are not limited to, inter alia, systems, networks, 
integrated circuit chips, embedded processors, ASICs, methods, and computer-readable 

10 media containing instructions. One or multiple systems, devices, components, etc. may 
comprise one or more embodiments, which may include some elements or limitations of a 
claim being performed by the same or different systems, devices, components, etc. The 
embodiments described hereinafter embody various aspects and configurations within the 
scope and spirit of the invention, with the figures illustrating exemplary and non-limiting 

15 configurations. 

As used herein, the term "packet 11 refers to packets of all types or any other units 
of information or data, including, but not limited to, fixed length cells and variable length 
packets, each of which may or may not be divisible into smaller packets or cells. The term 
"packet" as used herein also refers to both the packet itself or a packet indication, such as, 

20 but not limited to all or part of a packet or packet header, a data structure value, pointer or 
index, or any other part or direct or indirect identification of a packet or information 
associated therewith. For example, often times a router operates on one or more fields of 
a packet, especially the header, so the body of the packet is often stored in a separate 
memory while the packet header is manipulated, and based on the results of the 

25 processing of the packet (i.e., the packet header in this example), the entire packet is 

forwarded or dropped, etc. Additionally, these packets may contain one or more types of 
information, including, but not limited to, voice, data, video, and audio information. The 
term "item" is used generically herein to refer to a packet or any other unit or piece of 
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information or data, a device, component, element, or any other entity. The phrases 
"processing a packet' 1 and "packet processing" typically refer to performing some steps or 
actions based on the packet contents (e.g., packet header or other fields), and such steps 
or action may or may not include modifying, storing, dropping, and/or forwarding the 
5 packet and/or associated data. 

The term "system" is used generically herein to describe any number of 
components, elements, sub-systems, devices, packet switch elements, packet switches, 
routers, networks, computer and/or communication devices or mechanisms, or 
combinations of components thereof. The term "computer" is used generically herein to 

10 describe any number of computers, including, but not limited to personal computers, 
embedded processing elements and systems, control logic, ASICs, chips, workstations, 
mainframes, etc. The term "processing element" is used generically herein to describe any 
type of processing mechanism or device, such as a processor, ASIC, field programmable 
gate array, computer, etc. The term "device" is used generically herein to describe any 

15 type of mechanism, including a computer or system or component thereof. The terms 

"task" and "process" are used generically herein to describe any type of running program, 
including, but not limited to a computer process, task, thread, executing application, 
operating system, user process, device driver, native code, machine or other language, 
etc., and can be interactive and/or non-interactive, executing locally and/or remotely, 

20 executing in foreground and/or background, executing in the user and/or operating system 
address spaces, a routine of a library and/or standalone application, and is not limited to 
any particular memory partitioning technique. The steps, connections, and processing of 
signals and information illustrated in the figures, including, but not limited to any block 
and flow diagrams and message sequence charts, may typically be performed in the same 

25 or in a different serial or parallel ordering and/or by different components and/or 

processes, threads, etc., and/or over different connections and be combined with other 
functions in other embodiments, unless this disables the embodiment or a sequence is 
explicitly or implicitly required (e.g., for a sequence of read the value, process 
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the value - the value must be obtained prior to processing it, although some of the 
associated processing may be performed prior to, concurrently with, and/or after the read 
operation). Furthermore, the term "identify" is used generically to describe any manner or 
mechanism for directly or indirectly ascertaining something, which may include, but is 
5 not limited to receiving, retrieving from memory, determining, defining, calculating, 
generating, etc. 

Moreover, the terms "network" and "communications mechanism" are used 
generically herein to describe one or more networks, communications media or 
communications systems, including, but not limited to the Internet, private or public 

10 telephone, cellular, wireless, satellite, cable, local area, metropolitan area and/or wide 
area networks, a cable, electrical connection, bus, etc., and internal communications 
mechanisms such as message passing, interprocess communications, shared memory, etc. 
The term "message" is used generically herein to describe a piece of information which 
may or may not be, but is typically communicated via one or more communication 

1 5 mechanisms of any type. 

The term "storage mechanism" includes any type of memory, storage device or 
other mechanism for maintaining instructions or data in any format. "Computer-readable 
medium" is an extensible term including any memory, storage device, storage 
mechanism, and other storage and signaling mechanisms including interfaces and devices 

20 such as network interface cards and buffers therein, as well as any communications 

devices and signals received and transmitted, and other current and evolving technologies 
that a computerized system can interpret, receive, and/or transmit. The term "memory" 
includes any random access memory (RAM), read only memory (ROM), flash memory, 
integrated circuits, and/or other memory components or elements. The term "storage 

25 device" includes any solid state storage media, disk drives, diskettes, networked services, 
tape drives, and other storage devices. Memories and storage devices may store 
computer-executable instructions to be executed by a processing element and/or control 
logic, and data which is manipulated by a processing element and/or control logic. The 
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term "data structure" is an extensible term referring to any data element, variable, data 
structure, database, and/or one or more organizational schemes that can be applied to data 
to facilitate interpreting the data or performing operations on it, such as, but not limited to 
memory locations or devices, sets, queues, trees, heaps, lists, linked lists, arrays, tables, 
5 pointers, etc. A data structure is typically maintained in a storage mechanism. The terms 
"pointer" and "link" are used generically herein to identify some mechanism for 
referencing or identifying another element, component, or other entity, and these may 
include, but are not limited to a reference to a memory or other storage mechanism or 
location therein, an index in a data structure, a value, etc. 

10 The term "one embodiment" is used herein to reference a particular embodiment, 

wherein each reference to "one embodiment" may refer to a different embodiment, and 
the use of the term repeatedly herein in describing associated features, elements and/or 
limitations does not establish a cumulative set of associated features, elements and/or 
limitations that each and every embodiment must include, although an embodiment 

15 typically may include all these features, elements and/or limitations. In addition, the 
phrase "means for xxx" typically includes computer-readable medium containing 
computer-executable instructions for performing xxx. 

In addition, the terms "first," "second," etc. are typically used herein to denote 
different units (e.g., a first element, a second element). The use of these terms herein does 

20 not necessarily connote an ordering such as one unit or event occurring or coming before 
another, but rather provides a mechanism to distinguish between particular units. 
Additionally, the use of a singular tense of a noun is non-limiting, with its use typically 
including one or more of the particular thing rather than just one (e.g., the use of the word 
"memory" typically refers to one or more memories without having to specify "memory 

25 or memories," or "one or more memories" or "at least one memory", etc.). Moreover, the 
phrases "based on x" and "in response to x" are used to indicate a minimum set of items x 
from which something is derived or caused, wherein "x" is extensible and does not 
necessarily describe a complete list of items on which the operation is performed, etc. 

10 
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Additionally, the phrase "coupled to" is used to indicate some level of direct or indirect 
connection between two elements or devices, with the coupling device or devices 
modifying or not modifying the coupled signal or communicated information. The term 
"subset" is used to indicate a group of all or less than all of the elements of a set. The term 
5 "subtree" is used to indicate all or less than all of a tree. Moreover, the term "or" is used 
herein to identify a selection of one or more, including all, of the conjunctive items. 

Disclosed are, inter alia, methods, apparatus, data structures, computer-readable 
media, mechanisms, and means for weighted random scheduling particularly applicable to 
packet switching systems. One embodiment generates a request to send a packet to one of 
10 the outputs of the switch for each input of a switch by weighted randomly selecting one of 
the outputs to which the particular input has one or more packets to send. One of the 
requests is granted for each different one of the outputs for which one or more requests 
were generated. Packets are sent between the inputs and the output corresponding to the 
granted requests. 

15 In one embodiment, a round consisting of generating the request, granting one of 

the requests, and sending packets is performed at least two times per packet time, with a 
packet time corresponding to the amount of time allocated to receive a packet at an input 
from an external source. In one embodiment, the switch is configured to setup 
connections between the switch inputs and the outputs corresponding to the granted 

20 requests prior to the sending packets. In one embodiment, the granting one of the requests 
includes randomly selecting one of the requests for each of the outputs for which one or 
more of the requests were generated. One embodiment performs one or more rounds of 
generating requests by weighted randomly selecting among all or a subset of the outputs, 
and selectively granting the requests corresponding to available bandwidth after granting 

25 one of the requests and before sending packets. 

In one embodiment, the weighted random selection is weighted based on a 
number of packets to send to each of the outputs by a corresponding input of the inputs. 
In one embodiment, the weighted random selection is weighted based on a number of 

11 
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bytes to send to each of the outputs by a corresponding input of the inputs. In one 
embodiment, the weighted random selection is weighted based on the last times packets 
were sent from a corresponding input of the inputs to each of the outputs. In one 
embodiment, the weighted random selection is weighted based on classes of service 
5 associated with packets to send to each of the outputs by a corresponding input of the 
inputs. In one embodiment, the weighted random selection is weighted based on at least 
two items from a group consisting of: (a) the number of packets to send to each of the 
outputs by a corresponding input of the inputs, (b) the number of bytes to send to each of 
the outputs by a corresponding input of the inputs, (c) the last times packets were sent 
10 from a corresponding input of the inputs to each of the outputs, and (d) classes of service 
associated with packets to send to each of the outputs by a corresponding input of the 
inputs. 

One embodiment includes multiple request generators and one or more grant 
arbiters coupled to the request generators. Each of the request generators is configured to 

15 weighted randomly generate a request for its associated input of multiple inputs of a 
switch, the request including a weighted random selection of one of the outputs of the 
switch. One or more grant arbiters are configured to grant requests received from the 
request generators such that one request for each requested output is granted. The inputs 
are configured to send packets corresponding to said granted requests. 

20 One embodiment includes a control to configure the switch to setup connections 

between said inputs and said outputs corresponding to said granted requests. In one 
embodiment, one or more grant arbiters randomly select a request to grant for a particular 
output from said requests received for the particular output. In one embodiment, inputs 
are configured to send packets from their respective virtual output queues corresponding 

25 to the granted requests. One embodiment includes multiple line cards and a switching 
board including the switch. In one embodiment, the virtual output queues are located on 
the line cards. In one embodiment, the virtual output queues are located on the switching 
board. 
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In one embodiment, the weighted random selection is weighted based on a 
number of packets or bytes to send to each of the outputs by a corresponding input of the 
inputs. In one embodiment, the weighted random selection is weighted based on the last 
times packets were sent from a corresponding input of the inputs to each of the outputs. In 
5 one embodiment, the weighted random selection is weighted based on classes of service 
associated with packets to send to each of the outputs by a corresponding input of the 
inputs. In one embodiment, the weighted random selection is weighted on at least two 
items from a group consisting of: (a) the number of packets to send to each of the outputs 
by a corresponding input, (b) the number of bytes to send to each of the outputs by a 

10 corresponding input, (c) the last times packets were sent from a corresponding input to 
each of the outputs, and (d) the classes of service associated with packets to send to each 
of the outputs by a corresponding input. 

One embodiment, for each particular input of a inputs, generates a first request 
including weighted randomly selecting one of the outputs. One of the first requests for 

15 each different one of the outputs for which one or more of said first requests was 

generated is granted. For each particular input of the inputs whose first request was not 
granted, a second request is generated, which includes weighted randomly selecting one 
of the outputs (possibly limited to those not already have a request previously granted). 
One of said second requests is granted for each different one of the outputs not already 

20 having a corresponding first request granted and for which one or more of said second 
requests was generated. 

In one embodiment, the weighted random selection is weighted based on the 
number of packets or bytes to send to each of the outputs by the corresponding input. In 
one embodiment, packets are sent between said inputs and said outputs based said granted 

25 first and second requests. In one embodiment, only two request phases are performed per 
packet time, said two request phases consisting of said generating said first requests and 
said generating said second requests. In one embodiment, each particular request to grant 
of said first requests and said second requests for each particular output is randomly 

13 
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selected from all the respective said first requests or said second request for said 
particular output. 

One embodiment includes multiple request generators and one or more grant 
arbiters coupled to the request generators. Each of the request generators is configured to 
5 weighted randomly generate a first round request and as required a second round request 
for its associated input of a inputs of a switch, with the first round request including a 
first weighted random selection of one of a outputs and the second round request 
including a second weighted random selection of one of the outputs if the first round 
request was not granted. The grant arbiters are configured to grant requests received from 

10 the request generators such that one request for each requested output in the first round of 
requests is granted, and one request for each requested output in the second round of 
requests is granted if a grant for said requested output was not generated for a request in 
the first round of requests. 

In one embodiment, the one or more grant arbiters randomly select a request to 

15 grant for a output from the set of requests received for the output. In one embodiment, the 
weighted random generation is weighted based on the number of packets or bytes to send 
to each of the outputs by the corresponding input. 

One embodiment randomly picks one of the candidate virtual output queues / 
destination for each switch input, where the random selection is weighted by the length of 

20 each of the queues for the input. Scheduling requests are then sent to the chosen 

destination. Each destination, after getting requests from all the candidate inputs, sends a 
grant back to the randomly chosen input. Inputs that get a grant back from the destination, 
will send a corresponding packet in the corresponding packet time. The two stage random 
selection process represents a single phase of the weighted random selection process. A 

25 second phase is performed for those switch inputs that have not received a grant and for 
those destinations for which a grant was not generated in the first phase (i.e., no request 
was generated during the first phase). In this manner, a minimal amount of scheduling 
overhead is incurred, and desired traffic throughput is received when these two 

14 
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scheduling phases are performed in the time allotted to send one or more packets through 
the switch. One embodiment performs more than two scheduling phases in the time 
allotted to send one or more packets through the switch. 

FIGs. 1 A-C and 2A-B are block diagrams of a few of an unlimited number of 
5 embodiments using weighted random scheduling of packets. Turning first to FIG. 1 A, 
illustrated is a system 100 including a switching mechanism (e.g., a physical switching 
unit, a switching fabric, a crossbar, bus, or any other mechanism for switching packets, 
etc.) 102, a control with scheduler and memory 101, and multiple line cards 103-104. 
System 100 schedules packets being sent among line cards 103-104 using weighted 

10 random scheduling. Note, weighted random scheduling can be used in conjunction with a 
blocking or a non-blocking switch. 

FIG. IB illustrates a system 1 10 that uses weighted random scheduling of packets. 
Line cards 111-112 each contain virtual output queues in this exemplary embodiment, 
and switching board 120 includes a control with scheduler and memory 121, multiple 

15 buffers 122-123, and a switching mechanism 125. Line cards 1 11-1 12 generate the 

scheduling requests based on a weighted random selection. Packets are then sent to input 
buffers 122-123 according to the determined weighted random scheduling, with the 
particular weight applied being typically based on the occupancies of the virtual output 
queues and/or characteristics of the packets (e.g., number of packets or bytes, age of 

20 packets, quality of service, etc.). Note, in one embodiment, input buffers 122-123 can 
each store multiple packets, and thus previously weighted random scheduled packets can 
be queued while the scheduling is performed for subsequent packet times. Control with 
scheduler and memory 121 typically generates the grants based on the received 
scheduling requests, and controls the configuration of switching mechanism 125 so as to 

25 allow the scheduled packets to be sent through it. 

FIG. 1C illustrates a system 140 that uses weighted random scheduling of packets. 
Line cards 141-142 communicate packets with switching board 150 in this exemplary 
embodiment, and switching board 150 includes a control with scheduler and memory 

15 
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151, virtual output queues 152-153, and a switching mechanism 155. In this embodiment, 
the weighted random scheduling of packets is performed by control with scheduler and 
memory 151, with the particular weight applied being typically based on the occupancies 
of the virtual output queues 152-153 and/or characteristics of the packets (e.g., number of 
5 packets or bytes, age of packets, quality of service, etc.). Packets are then sent from 

virtual output queues 152-153 according to the determined weighted random scheduling, 
with the configuration of switching mechanism 155. 

FIG. 2A illustrates a system 200 that uses weighted random scheduling of packets. 
Packets 201 are received and placed in packet queues 202. The queue occupancies 21 1 

10 are communicated to the weighted random request generator 210 for each switch input 
(also referred to herein as a switch input), which also communicated with grant 
generator(s) 220 (e.g., one grant generator, one grant generator for each destination, etc.). 
Grant generator(s) 220 communicate the determined switch configuration 221 to the 
switching mechanism 225 based on the generated grants 217, and the weighted random 

15 request generators 210 provide send instructions to packet queues 202 to send packets 
corresponding to the weighted random scheduled packets. Packets 203 are sent from 
packet queues 202, through the accordingly configured switching mechanism 204 to 
produce the switched packets 205. 

FIG. 2B illustrates a system 230 that uses weighted random scheduling of packets. 

20 Packets 231 are received and placed in packet queues 232. The queue occupancies 241 
are communicated to the weighted random generator 210 for each switching mechanism 
input. Weighted random request and grant generator 250 uses one or more processes 
and/or customized control logic to perform the weighted random scheduling of packets, 
and to inform packet queues 232 of the packets to send (243) and switching mechanism 

25 234 of the corresponding switch configuration 251. Packets 213 are sent from packet 

queues 232, through the accordingly configured switching mechanism 234 to produce the 
switched packets 235. 
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FIG. 3 is a block diagram of a component 300 used in one embodiment for 
performing weighted random scheduling, such as for generating weighted random 
requests and/or grants. In one embodiment, component 300 performs one or more 
processes corresponding to one of the flow diagrams illustrated or otherwise described 
5 herein. 

In one embodiment, system or component 300 includes a processing element 301, 
memory 302, storage devices 303, and an interface 304 for sending and receiving packets, 
items, and/or other information (e.g., queue occupancy, instructions to send packets, 
switch configuration to a communicatively coupled switch, etc.), which are typically 
10 coupled via one or more communications mechanisms 309 (shown as a bus for 

illustrative purposes.) In one embodiment, memory 302 and/or storage devices 303 are 
used to store packets on which the weighted random scheduling of the packets is 
performed. 

Various embodiments of component 300 may include more or less elements. The 
15 operation of component 300 is typically controlled by processing element 301 using 

memory 302 and storage devices 303 to perform one or more tasks or processes. Memory 
302 is one type of computer-readable media, and typically comprises random access 
memory (RAM), read only memory (ROM), flash memory, integrated circuits, and/or 
other memory components. Memory 302 typically stores computer-executable 
20 instructions to be executed by processing element 301 and/or data which is manipulated 
by processing element 301 for implementing functionality in accordance with an 
embodiment. Storage devices 303 are another type of computer-readable media, and 
typically comprise solid state storage media, disk drives, diskettes, networked services, 
tape drives, and other storage devices. Storage devices 303 typically store 
25 computer-executable instructions to be executed by processing element 301 and/or data 
which is manipulated by processing element 301 for implementing functionality in 
accordance with an embodiment. 
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FIG. 4A is a diagram illustrating the generation of requests and grants used in one 
embodiment. Weighted random request generators 400 determine a request for each 
switch input as indicated by processing 411. This selection is weighted based on the 
occupancies of the virtual output queues and/or characteristics of the packets (e.g., 
5 number of packets or bytes, age of packets, quality of service i.e., higher priority traffic is 
given greater weight, etc.). 

For example, assume there are four switch outputs, with eight packets queued at 
one input with one packet queued for the first output, two packets queued for the second 
output, one packet queued for the third output, and four packets queued for the fourth 

1 0 output. One embodiment uses a direct weighted proportion of the number of queued 
packets, while one embodiment uses different proportion calculation based on the 
occupancy of the virtual output queues. In this example, one embodiment uses a weight 
of one-eighth to send a packet from the queue corresponding to switch output/destination 
one; a weight of one-quarter to send a packet from the queue corresponding to switch 

15 output/destination two; a weight of one-half to send a packet from the queue 

corresponding to switch output/destination three; and a weight of one-eighth to send a 
packet from the queue corresponding to switch output/destination four. Then, a random 
number (i.e., a true random number, a pseudo random number, or any approximated 
random value) is generated with a corresponding destination four selected four times as 

20 likely as destinations one or three, and twice as likely as destination two. There are many 
techniques for determining this weighted selection. For example, one embodiment maps 
the percentages for each of the destination to a cumulative fractional value between zero 
and one, and a random value between zero and one is generated. In this example, 
destination one corresponds to 0.00 to 0.125, destination two corresponds to 0.126 to 

25 0.375, destination three corresponds to 0.376 to 0.500, and destination four corresponds 
to 0.501 to 1.000. 

Returning to FIG. 4A, the determined weighted random requests 412 for each 
switch input are communicated to grant generator(s) 402, which randomly or otherwise 
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select one of the requests for each destination for which a request was made (as illustrated 
by processing 413). These grants 414 are communicated to weighted random request 
generator(s) 400 (or in one embodiment to the control for sending packets and/or the 
control for configuring the switch). Packets are sent according to the grants as illustrated 
5 by reference number 43 1 . In one embodiment, typically one packet per grant is sent as 
that is the most that can be sent from one input to one output of a switch at one time. 
However, in one embodiment, multiple packets can be sent from an input and/or received 
by an output at a time, so the number of requests generated for an input and the number of 
grants generated for an output is accordingly adjusted. 

10 FIG. 4B is similar to FIG. 4 A with the addition used in one embodiment to 

perform multiple requests/grant rounds prior to sending packets corresponding to the 
generated grants. Weighted random request generators 400 determine a request for each 
switch input as indicated by processing 411. This selection is weighted based on the 
occupancies of the virtual output queues and/or characteristics of the packets (e.g., 

15 number of packets or bytes, age of packets, quality of service i.e., higher priority traffic is 
given greater weight, etc.). The determined weighted random requests 412 for each 
switch input are communicated to grant generator(s) 402, which randomly or otherwise 
select one of the requests for each destination for which a request was made (as illustrated 
by processing 413). These grants 414 are communicated to weighted random request 

20 generator(s) 400. 

Each request generator that did not receive a grant, then performs an additional 
request phase, including determining their requests based on a weighted random selection 
based on the occupancies of the virtual output queues and/or characteristics of the packets 
in their queues (415). In one embodiment, the additional requests may have been 

25 determined prior to receiving a grant 414. In one embodiment, all destinations remain 
available for a request in the additional phase. In one embodiment, grants 414 include an 
indication of the granted destinations, and thus these destinations can be removed from 
contention in identifying a destination for which to request during the second phase. In 
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one embodiment, the additional phase request determination excludes the destination 
requested in a prior phase (as it was granted to a different requester and is unavailable 
during the additional phase) from the weighted random selection in the additional phase. 
In one embodiment, more than one packet can be sent from an input and/or received by 
5 an output in a packet time, and therefore the number of requests and/or grants are 
accordingly increased. 

The weighted random requests 416 for the switch inputs that can send more 
packets in a packet time (e.g., their prior phase request was not granted) are 
communicated to grant generator(s) 402, which then randomly or otherwise select (417) 

10 among the received requests 416 for the destinations which can accept additional packets 
in a packet time (e.g., those destinations which did not receive/grant a request during a 
prior phase in one embodiment). The additional phase grants 418 are then communicated 
to weighted random request generator(s) 400 (or in one embodiment to the control for 
sending packets and/or the control for configuring the switch). Packets are sent according 

15 to the grants as illustrated by reference number 441 . The number of rounds of 

request/grant phases may vary based on the embodiment, and can include one, two, or 
even more than two rounds. 

FIG. 5 A is a flow diagram of a process used in one embodiment for performing 
weighted random scheduling of packets by a request generator for a particular switch 

20 input. Processing begins with process block 500, and proceeds to process block 502, 
wherein a request is generated by identifying one of the possible destinations (i.e., 
identifying the destination/switch output, virtual output queue, or other corresponding 
value) based on a weighted random selection based on packet occupancies of the virtual 
output queues and/or characteristics of the packets (e.g., number of packets or bytes, age 

25 of packets, quality of service i.e., higher priority traffic is given greater weight, etc.). As 
determined in process block 504, if the request was granted, then in process block 506, a 
packet is sent to the destination from the virtual output queue corresponding to the 
destination. Processing is complete as indicated by process block 508. 
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FIG. 5B is a flow diagram of a process used in one embodiment for determining 
grants from received requests. Processing begins with process block 520, and proceeds to 
process block 522, wherein zero or more requests are received for each particular 
destination, and one of these requests is randomly or otherwise selected for each 
5 particular destination, and a corresponding grant is communicated to the requester. In 
process block 524, the switch is configured, if required, based on the granted requests so 
the packets can be sent (e.g., in process block 506 of FIG. 5 A). Processing is complete as 
indicated by process block 526. 

FIG. 6 is a flow diagram of a process used in one embodiment for performing 

10 weighted random scheduling of packets in a packet time, with a packet time 

corresponding to the amount of time allocated to receive a packet at an input of a switch 
from an external source. Processing begins with process block 600, and proceeds to 
process block 602, wherein requests are generated for each switch input having one or 
more packets to send based on a weighted random selection. In process block 604, one of 

15 the requests for each of the requested destinations is randomly or otherwise granted. In 
process block 606, the switch is configured, if required, to accommodate the granted 
requests, and the packets are sent from the corresponding inputs to the corresponding 
outputs. As determined in process block 608, if there are more request/grant/send packet 
rounds to be performed in the packet time, processing returns to process block 602. 

20 Otherwise, processing is complete as indicated by process block 610. 

In view of the many possible embodiments to which the principles of our 
invention may be applied, it will be appreciated that the embodiments and aspects thereof 
described herein with respect to the drawings/figures are only illustrative and should not 
be taken as limiting the scope of the invention. For example and as would be apparent to 

25 one skilled in the art, many of the process block operations can be re-ordered to be 
performed before, after, or substantially concurrent with other operations. Also, many 
different forms of data structures could be used in various embodiments. The invention as 
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described herein contemplates all such embodiments as may come within the scope of the 
following claims and equivalents thereof. 
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